Key Expression driven Record Mining for Event Calendar Search

نویسندگان

  • Yeong Su Lee
  • Michaela Geierhos
چکیده

This paper presents an approach to extract data records from websites, particularly ones with event calendars. We therefore use languagespecific key expressions and HTML patterns to recognize every single event given on the investigated web page. One of the most remarkable advantages of our method is that it does not require any additional classification steps based on machine learning algorithms or keyword extraction methods; it is a so-called one-step mining technique. Our experimental results obtained on German opera websites show excellent results in precision and recall. Furthermore, we could demonstrate that our proposed technique outperforms other data record mining applications run on event sites.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data-Driven Approaches to Improve the Quality of Clinical Processes: A Systematic Review

Background: Considering the emergence of electronic health records and their related technologies, an increasing attention is paid to data driven approaches like machine learning, data mining, and process mining. The aim of this paper was to identify and classify these approaches to enhance the quality of clinical processes. Methods: In order to determine the knowledge related to the research ...

متن کامل

Mining Program Workflow from Interleaved Logs

Successful software maintenance is becoming increasingly critical due to the increasing dependence of our society and economy on software systems. One key problem of software maintenance is the difficulty in understanding the evolving software systems. Program workflows can help system operators and administrators to understand system behaviors and verify system executions so as to greatly faci...

متن کامل

Event-driven and Attribute-driven Robustness

Over five decades have passed since the first wave of robust optimization studies conducted by Soyster and Falk. It is outstanding that real-life applications of robust optimization are still swept aside; there is much more potential for investigating the exact nature of uncertainties to obtain intelligent robust models. For this purpose, in this study, we investigate a more refined description...

متن کامل

Continuous and incremental data mining association rules using frame metadata model

Most organizations have large databases that contain a wealth of potentially accessible information. The unlimited growth of data will inevitably lead to a situation in which it is increasingly difficult to access the desired information. There is a need to extract knowledge from data by Knowledge Discovery in Database. Data mining is the discovery stage of KDD whereas association rule is a pos...

متن کامل

Duration of exclusive breastfeeding; validity of retrospective assessment at nine months of age

BACKGROUND In cross sectional, case control and retrospective cohort studies, duration of Exclusive Breastfeeding (EBF) usually depends on maternal recall. Retrospective data are often subjected to recall bias and could lead to a potential for exposure misclassification. The purpose of the present paper is to assess the validity of maternal recall of EBF duration during infancy, after cessation...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009